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Abstract 

Quantum strategies are introduced into evolutionary games. The agents using quantum strategies are regarded as in- 
vaders whose fraction generally is 1% of a population in contrast to the 50% defectors. In this paper, the evolution 
of strategies on networks is investigated in a defector-dominated population, when three networks (Regular Lat- 
tice, Newman- Watts small world network, scale-free network) are constructed and three games (Prisoners' Dilemma, 
Snowdrift, Stag-Hunt) are employed. As far as these three games are concerned, the results show that quantum strate- 
gies can always invade the population successfully. Comparing the three networks, we find that the regular lattice is 
most easily invaded by agents that adopt quantum strategies. However, for a scale-free network it can be invaded by 
agents adopting quantum strategies only if a hub is occupied by an agent with a quantum strategy or if the fraction of 
agents with quantum strategies in the population is significant. 
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1. Introduction 

Game theory has been widely applied in both social and biological fields, in order to describe interactions between 
agents. Recently, the evolution of behavior of agents in a population, in the framework of evolutionary games on 
graphs, has attracted much interdisciplinary attention. Nowak and May [[llSl firstly introduced the spatial Prisoner's 
Dilemma (PD) game, in which agents (players) occupy all vertices of a two-dimensional lattice and the edges represent 
neighbor relations between the corresponding agents. This pioneering work triggered an intensive investigation of 
spatial games and the PD game is a model frequently adopted by researchers QQl- It is known that the structure of 
the network is also a key factor in the evolution of behavior of agents. Later, a shift from evolutionary games on regular 
lattices to evolutionary games on complex networks was proposed Jstl, in particular on small world networks 10,00 
and on scale-free networks ||9, Meanwhile, other games, such as Snowdrift (SD) fl5l, Stag-Hunt (SH) 111, 



and Public Goods games, have produced interesting results 114. 15, 16ill7n . 

Surprisingly, the concept of evolutionary games has been extended to the microworld to describe interactions of 
biological molecules [ITs, T^, 13, 21 ], a domain where quantum mechanics defines the laws. Meanwhile, game theory 



is also generalized to the quantum regime, and a new area called quantum game theory has emerged from the field 
of quantum computation. In recent years, much interest has been focused on quantum game theory. For instance, 
Meyer's results |22] showed that if an agent in a penny flip game is allowed to implement quantum strategies, she/he 
can always defeat her/his opponent playing a classical strategy and can thus increase her/his expected payoff. Eisert 
et al. 12311 quantized the PD and demonstrated that it is possible to escape the dilemma when both players resort to 
quantum strategies. Marinatto et al. |24] found a unique equilibrium for the Battle of the Sexes game, when entangled 
strategies were allowed. Later, evolutionarily stable strategies in quantum games and an evolutionary quantum game 



were also studied by Iqbal et al. II25I1 and Kay et al. 112611 respectively. Moreover, quantum games have also been 



implemented using quantum computers ll27ll28L 12911 . For further background on quantum games, see 1130113 111 . 

In quantum game theory, agents are allowed to use quantum strategies from a quantum strategy set that is a much 
larger set than a classical one, i.e., a classical strategy set is only a subset of a quantum strategy set. This larger space 
offers a possibility for a diversity of agent behavior and allows new patterns to emerge. In this paper, we assume all 
agents in a population are quantum agents who can use quantum strategies to play games with their neighbors and 
make decisions. However, initially, only a few randomly selected agents, about 1% in the population, are assigned 
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quantum strategies, while the others are players with strategies taken from the classical strategy set. The fraction of 
defectors is about half of the population. This work discusses how quantum strategies spread in the population and 
how strategies evolve over repeated games played on networks. Therefore, three networks (Regular Lattice, Newman- 
Watts small world network, scale-free network) are constructed and three games (Prisoners' Dilemma, Snowdrift, 
Stag-Hunt) are employed. The games encapsulate agents' responses to different external stimuli, while those networks 
provide different environments for agents. It is worth noting that a quantum strategy is not a probabilistic sum of pure 
classical strategies (except under special conditions) and it also cannot be reduced to the pure classical strategies ll25ll . 

The rest of this paper is organized as follows: Section 2 briefly introduces some concepts of quantum computation 
and quantum games. Next, the model and the simulation setup is described in Section 3 and Section 4 respectively. 
In Section 5, results are demonstrated firstly. Later, the situation of strategies spreading on networks is discussed and 
the evolution of strategies is analyzed when different games are adopted. The conclusion is given in Section 6. 



2. Quantum Games 



Before introducing quantum games, we describe some basic concepts of quantum computation. In quantum 
computation, a qubit is the elementary unit, which is typically a microscopic system, such as a nuclear spin or a 
polarized photon, while the Boolean states and 1 are represented by a prescribed pair of normalized and mutually 
orthogonal quantum states labeled as {|0), |1)) to form a 'computational basis' 13211 . Furthermore, any pure state of the 



qubit can be written as a superposition state a\Q) + for some a and /3 satisfying |ap + \/3[^ = 1 13211 . Also, if any 
manipulations on qubits are needed, they have to be performed by unitary operations, which can be carried out by a 
quantum logic gate or a quantum circuit [32]. The most often used quantum gate is the Hadamard gate. If a qubit in 
state 1 0) or 1 1) is manipulated by it, the qubit will be in the following state 
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For further details, see 11321 13311 . 

In the following, we take the PD as an example for introducing quantum games. As is known, PD can be used to 
model many strategic phenomena in the real world and it has been widely applied in a number of scientific fields. In 
this symmetric game, each of two players has two available strategies. Cooperation (C) and Defection (D). Next, each 
of the two players chooses a strategy against the other's at the same time, but both sides do not know the opponent's 
strategy. Finally, each agent acquires a payoff, where the payoff matrix to the first agent can be written as 
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Accor ding to the conclusion in classical game theory, the strategy profile (D, D) is the unique Nash Equilibrium 
(NE) 11341 13511 . but unfortunately the strategy profile (C, C) is merely the best choice that is Pareto optimal 13611 . 
Hence, the dilemma is produced. 

However, Eisert et al. quantized the PD game and introduced an elegant scheme, the physical model of a quantum 
game, which is shown in Fig. fTI ll23il . According to their results, the dilemma in the classical counterpart can be 
escaped in a restricted strategic space fz^, when quantum strategies are used. In their model, at first two basis vectors 
{|C - 0), \D - 1)) in Hilbert space are assigned to the possible outcomes of the classical strategies, C = and D = 1 
respectively [23]. Then, suppose the initial state is \tJ/o) = J |00) before the game is played, where J is an entangling 
operator that is known to both players. For a 2 x 2 game, the entangUng operator J has form below [|37, 38 1 
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where w e [0, 7r/2] is a measure of entanglement of a game. When oj 
which the entangling operator can be written as 



7t/2, there is a maximally entangled game, in 
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Figure 1 : The block diagram for the Eisert's scheme. 



Next, each agent chooses a unitary operator F as a strategy from the two-parameter strategy space S 112311 
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where y e [0, n}, (p e [0, n/l]. Then, she/he operates it on the qubit that belongs to her/him, which makes the game in 
a state (Fi ® Y2)J\QQ}. In the end, before a projective measurement on the basis {|0), 1 1)) is carried out, the final state is 



|<A/) = f{Yi ® Y2)J 100). 
As such, the first agent's expected payoff is written as 

n(r,0) = R\{<Pfmf + 5K^/|0i)|2 + rK,A/UO>l' + mf\n}f. 
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3. The Model 

Assume there is an undirected network G{V, E) with nodes, in which V is the set of nodes and E is the set of 
links. Also, each node / e V is occupied by an agent and its neighbor j is any other agent such that there is a link 
between them, so the set of neighbors of an agent / can be defined as F, - {j\eij e E,j e V\i}. 

In this paper, three different networks will be constructed. They are a Regular Lattice (RL) with periodic boundary 
conditions, a Newman- Watts (NW) small world network i39l. l40ll and a Scale-Free (SF) network |41, 42]. When 



periodic boundary conditions are involved, they can guarantee each node in the regular lattice has four neighbors. 
In addition, for avoiding isolated nodes, the NW small world network is selected in our work instead of the Watts- 
Strogatz (WS) network. The NW network can be established in two steps |43]. At first, a regular lattice, with periodic 
boundary conditions, is constructed, and then links are added with probability pnw between any two randomly chosen 
nodes. Finally, the SF network is established according to the Barabasi-Albert model 114 111 whose algorithm consists 
of two steps, growth and preferential attachment. It can start with a small network of mo <sc all connected nodes, 
and then a new node with m < nio links will be added to the network. Its m links will be connected to m different 
nodes chosen with probabiUty Psf (0 which can be calculated as below. 



Psf(i) = 
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Here, k is the degree of a node. This procedure will be repeated many times till the number of nodes of the network is 

Initially, each agent on the network is assigned a strategy randomly from the set of strategies {C, D, H, Q]. Next, 
an agent / will play a 2 x2 entangled quantum game in turn with each one of its |r,| neighbors according to the physical 
model of a quantum game (Fig.[T]), where the symbol | ■ | is the cardinality of a set. Throughout the paper, all quantum 
games are maximally entangled games, if not otherwise explicitly stated. And then its expected payoff 11,^, j e F, can 
be calculated by Eq.|7] The agent's total payoff is obtained by accumulating all it receives = Zijer/ n,j. 

After that, it will choose a neighbor from its neighborhood randomly and imitate its strategy with probability 

( ^\f',^ , Fj > FiJ e F; 
Pi = " " , (9) 

^ 0, otherwise 
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where a is a constant that can be calculated as below according to different games 14511 . 
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After all the agents acquire their payoffs, their strategies are updated synchronously. This process will be repeated by 
a maximum number of 10^ generations and the fractions of agents with different strategies are obtained by averaging 
the last 1000 generations, which produces a result of evolution of strategies. The final result is obtained by averaging 
over at least 100 of these results. If strategies of all agents do not change for 500 consecutive generations, it is deemed 
that an equilibrium has been reached and the iterations are stopped. 



4. Simulation setup 

Assume a population of 50x50 agents are located at nodes of the above mentioned networks. For the NW network, 
the probability that links are added between any two randomly chosen nodes is pnw - 0.5, while for the SF network, 
the number of nodes of the initial core network is mo = 2 or 3 and the links of each new node are set at m = 2. 
Throughout all simulations, the network topology remains static. In this paper, we consider two sets of strategies: 

Case 1. There are three strategies in the set 5i = {C,D,H}. Initially, three strategies, C (Cooperation), D (Defec- 
tion), H (Hadamard), are assigned to agents randomly and the fractions of strategies are 49%, 50% and 1% 
respectively. Here, the unitary operators C and D have the forms below 



Case 2. There are four strategies in the set 5 2 = {C, D, H, Q}. Initially, four strategies, C, D, H and Q, are assigned to 
the population randomly and the fractions of strategies are 49%, 49%, 1% and 1% respectively. The strategy 
Q takes the form 

^4 -.)■ 

In these two cases, the quantum strategy H brings a miracle move ll23ll when an agent uses it against the other's 
classical strategy. Also, the quantum strategy profile (2, Q) is a new NE observed by Eisert et al. 12311 . when players 
choose their strategies from the strategy space S - Y{y, <p). 

Then, the PD, SD and SH games are played by all agents on RL, NW, and SF networks respectively. To be 
compatible with previous studies and without loss of generality, the payoff matrix of the PD game is chosen as = 1 
(Reward), T - b{l < b <2) (Temptation), P - Q (Punishment) and 5=0 (Sucker's payoff) satisfying the inequalities 
T > R > P > S; the payoff matrix of the SD game asT=b>l,R = b- c/2, S - b - c and P - Q satisfying 
T > R > S > P, and the cost-to-benefit ratio of mutual cooperation is defined as r = c/(2b - c) [ 14], where c = 1 and 
< r < 1; the payoff matrix of the SH game sls R - I, T - r (0 < r < I), S = -r and f = satisfying R > T > P > S . 

The payoffs under any strategy profiles can be calculated according to the model of a quantum game in Section 2. 
As is known, the strategy profiles {H, H) and (Q, Q) are Nash equilibria for PD, SD and SH games in Case 1 and Case 
2 respectively. However, it is worth noting that for the SH game there are two Nash equilibria, i.e., besides (H, H) or 
(2, 0, the other is (C, C). 



5. Results and Discussion 

Throughout all simulations, the agents with quantum strategies H and Q are regarded as invaders, so their fractions 
are restricted to 1 % of the whole population initially. Thus, our simulations focus on investigating how the quantum 
strategies invade the population and how strategies evolve on networks. In order to elucidate the affects of quantum 
strategies, three games are considered to observe the behavior of quantum strategies. Furthermore, the structure of a 
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network is also a key factor for the evolution of strategies, so three different networks (RL, NW and SF) are constructed 
to test their influences. 

For Case 1, the upper and median sub-figures of Fig. |2] show that the fractions of agents with three strategies on 
RL and NW networks are similar. Given our three games, the quantum strategy H becomes the Evolutionarily Stable 
Strategy (ESS) almost from the outset when the PD game is played, i.e., even if the temptation T is only a little larger 
than the reward R, the strategy H can dominate the network successfully. However, for SD and SH games, it is more 
difficult for the strategy H to be the ESS. When the SD game is employed, the strategy H is played by all agents on 
the network, only if the cost-to-benefit ratio r > 0.32 and r > 0.40 on RL and NW networks respectively. Before this 
value, almost no agents use the strategy H, while at that time the frequencies of the strategies C and D are similar to 
the case of a SD game without a quantum strategy. For the SH game, a coordination game, the strategy H invades 
the network successfully when r > 0.82 and r > 0.67 respectively on RL and NW networks. From above analysis, it 
can be said that the SH game and the structure of NW network is more advantageous for classical strategies than the 
quantum ones. 
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Figure 2: Fractions of agents with tliree strategies in Case 1 on different games and networks, (a) Prisoners' Dilemma, (b) Snowdrift, (c) Stag-Hunt. 
In tlie upper, median and lower sub-figures of (a),(b) and (c), games are played on RL, NW and SF networks respectively. 

The results of evolution of strategies in Case 2 are shown in Fig. [3] From Fig. [3] it can be seen that for the PD 
the strategy Q can invade the whole network from the outset on both the RL and NW networks, and the strategy Q 
becomes the dominate strategy over H in Case 1 when the SD and SH games are adopted, which is improved about 
10%, 8% (SD on RL and NW) and 30%, 15% (SH on RL and NW) respectively. In contrast to Case 1, it can be 
inferred that the strategy Q is more aggressive than the strategy H. 
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Figure 3: Fractions of agents with four strategies in Case 2 on different games and networks, (a) Prisoners' Dilemma, (b) Snowdrift, (c) Stag-Hunt. 
In the upper, median and lower sub-figures of (a),(b) and (c), games are played on RL, NW and SF networks respectively. 



On the other hand, the evolution of strategies on the SF network is more complex due to the features of the SF 
network, i.e., the power-law distribution of degrees of nodes in the network. In both Case 1 and 2, all strategies fluc- 
tuate no matter which game is played. However, it can be observed that the classical strategies decrease in proportion 
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with the increase of the variable T or r, while the quantum strategies act conversely. According to the features of the 
SF network, if few quantum strategies are played by some agents with small degrees, it will be harder for them to 
invade the network. This raises the question that if a quantum strategy is employed by a certain agent with one of 
the first three largest degrees, namely a hub node, how will the quantum strategies evolve? In the next simulations, 
the agent with the first largest degree will be compulsorily assigned a strategy H in Case 1 or g in Case 2 after each 
agent chooses a strategy. The results are shown in the uppermost sub-figure in Fig. |4] and Fig. |5j while the other 
two sub-figures in Fig. |4] and Fig. |5] are the results when the agent with the second or the third largest degree plays a 
strategy H or Q. This procedure is applied on the PD, SD and SH games, respectively. 

From Fig. |4] and Fig.|5] it can be seen that when the agent with the first largest degree plays a quantum strategy, 
the fluctuations in the results reduce significantly, which means that if the node with the largest degree is occupied by 
an agent with a quantum strategy, then the strategy spreads out more quickly on the SF network and the population is 
invaded more easily by the strategy. If the quantum strategy is assigned to an agent with the second or the third largest 
degree, the fluctuations also decrease, but it is not lower than that of the first one. As for the three games, the SH 
game makes the fluctuations of results lower than those on the PD and SD games regardless of degrees. Furthermore, 
comparing the two strategies H and Q, we can find that the value of the temptation T or the cost-to-benefit r when the 
strategy Q becomes dominated is smaller than that of the strategy ft according to Fig.|4]and Fig.|5] 
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Figure 4: Fractions of agents with different strategies in Case 1 on a SF network, when the agents with the first three iargest degrees piay the 
quantum strategy H. (a) Prisoners' Ditemma. (b) Snowdrift, (c) Stag-Hunt. In the upper, median and lower sub-figures of (a),(b) and (c), the agent 
with the first, second or third degree is assigned the quantum strategy H respectively. 
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Figure 5: Fractions of agents with different strategies in Case 2 on a SF network, when the agents with the first three largest degrees play the 
quantum strategy Q. (a) Prisoners' Dilemma, (b) Snowdrift, (c) Stag-Hunt. In the upper, median and lower sub-figures of (a),(b) and (c), the agent 
with the first, second or third degree is assigned the quantum strategy Q respectively. 



For Case 2, the main reason why quantum strategies do not invade the population from the outset is that initially 
the fractions of agents with quantum strategies are too low. If the fraction is increased, the quantum strategy Q will 
be able to dominate the network. In simulations, the fractions of the strategy D and H remain constant at 49% and 
1%, while the fractions of the other two strategies are adjusted. Further, the fraction of the quantum strategy Q is set 
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at 10%, 20% and 25% respectively and correspondingly the fraction of the strategy C is 40%, 30% and 25%. For 
the SF network, the agent occupying the node with the largest degree is assigned a quantum strategy. Fig. |6] exhibits 
the evolution of strategies when the fraction of the quantum strategy Q is 25%, where we can see that the quantum 
strategy Q can invade the population successfully on all networks and becomes the ESS from the outset. 




Figure 6: Fractions of agents with different strategies in Case 2 on three games and three networks, when the fraction of the quantum strategy Q is 
increased at 25%. (a) Prisoners' Dilemma, (b) Snowdrift, (c) Stag-Hunt. In the upper, median and lower sub-figures of (a),(b) and (c), games are 
played on RL, NW and SF networks respectively. 



However, for Case 1, the situation is more complex than that in Case 2. Besides the reason mentioned above, 
another major reason also prevents the quantum strategy H being spread on networks that is the strategy profile 
{H, H) is not Pareto optimal although it is a Nash equilibrium. Hence, even if the fraction of the quantum strategy H 
is increased to 25%, it cannot dominate on all networks from the outset, especially on the SF network, as is shown in 
Fig El in which similarly the fraction of the quantum strategy H is set at 25% and correspondingly the fraction of the 
strategy C is also 25%, while the fraction of the strategy D remains constantly at 50%. 
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Figure 7: Fractions of agents with different strategies in Case 1 on three games and three networks, when the fraction of the quantum strategy H is 
increased at 25%. (a) Prisoners' Dilemma, (b) Snowdrift, (c) Stag-Hunt. In the upper, median and lower sub-figures of (a),(b) and (c), games are 
played on RL, NW and SF networks respectively. 



6. Conclusions 

In summary, we investigate the evolution of strategies on networks when quantum strategies H and Q are employed 
as invaders. For the evolution of strategies, the structure of a network is a decisive factor and a game represents an 
agent's response to some external stimuli. So, we construct three networks and introduce three games in this paper 
to investigate the evolution of strategies on these networks in a defector dominated population when diff'erent games 
are employed. As far as two quantum strategies are concerned, the strategy Q is more aggressive than the other one 
regardless of Case 1 or Case 2, because it is not only a Nash equilibrium but also Pareto optimal. Considering three 
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networks, we find that the population on a RL network can be invaded most easily by quantum strategies without any 
small world effects (properties), namely short average path length and large clustering coefficient. On the contrary, 
in the SF network the power-law distribution of degrees makes the spread of quantum strategies more difficult and 
exacerbates the fluctuations of results when few quantum strategies happen to be played by some agents only with 
small degrees. If an agent with a quantum strategy occupies a hub, i.e. a node with the largest degree, the fluctuations 
reduce considerably. Furthermore, if the fractions of quantum strategies are increased significantly, they can dominate 
a network from the outset. 
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